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Summary 

The  minimum  discrimination  information  theorem  yields 
minimum  discrimination  information  estimates  as  members  of 
an  exponential  family  which  can  be  expressed  in  log-linear 
additive  form.  We  illustrate  in  terms  of  a  particular 
contingency  table  the  relations  implied  by  hypotheses  and 
subhypotheses  of  interest  on  the  values  of  the  natural  para¬ 
meters  and  random  variables  of  the  exponential  family,  the 
related  moment  parameters,  and  the  estimates  of  cell  entries. 
Computations  are  carried  out  using  the  Deming-Stephan 
iterative  algorithm  and  its  extension  by  the  generalised 
iterative  scaling  procedure  of  Darroch  and  Ratcliff. 

1.  Introduction 


It  is  a  consequence  of  the  minimum  discrimination 
information  theorem,  that  minimum  discrimination  in¬ 
formation  distributions  are  formulated  as  members  of  an 
exponential  family  [3],  111].  For  applications  to  the  analysis 
of  contingency  tables  it  is  useful  to  express  the  exponential 
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family  as  a  log-linear  additive  model  [10]  ,  [13]  .  We 
propose  to  illustrate  in  terms  of  a  particular  set  of  data 
the  relations  implied  by  hypotheses  and  subhypotheses  of 
interest  on  the  values  of  the  natural  parameters  and  random 
variables  of  the  exponential  family,  the  related  moment 
parameters,  and  the  estimates  of  the  cell  entries. 

Since  our  estimates  in  this  discussion  are  constrained 
to  satisfy  certain  linear  relations  based  on  observed  values, 
the  minimum  discrimination  information  estimates  are  maximum 
likelihood  estimates  and  the  associated  minimum  discrimination 
information  statistics  are  log-likelihood  ratio  statistics 
[3],  [5],  [12],  [13].  Further  references  to  the  literature 
on  the  log-linear  model  may  be  found  in  papers  by  Bishop  [2], 
Dempster  [6],  Gokhale  [7],  Ku  et.  al.  [10],  Plackett  [14]. 

The  particular  data  we  shall  use  were  first  presented  by 
Cochran  [4],  are  given  in  table  1,  and  have  been  examined  from 
various  points  of  view  [1],  [2],  [7],  [8],  [10],  [15].  It 

should  be  noted  that  the  use  of  the  data  is  primarily  as  a 
vehicle  to  illustrate  the  procedures  and  concepts  rather  than 
the  reanalysis  of  the  data  per  se.  Computations  were  carried 
out  using  a  general  computer  program  for  the  analysis  of 
contingency  tables  including  the  Deming -Stephan  iterative 
algorithm  and  its  extension  by  the  generalised  iterative 
scaling  procedure  of  Darroch  and  Ratcliff  [5]  . 
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2 .  The  Hypothesis  and  Subhypotheses 

As  stated  by  Grizzle  [8,  p.  379],  "It  is  desired  to  compare 
the  mothers  of  Baltimore  school  children  who  have  been  referred 
by  their  teachers  as  presenting  behavioral  problems  to  mothers 
of  a  comparable  group  of  control  children.  For  each  mother  it 
is  recorded  whether  she  had  suffered  any  infant  losses  previous 
to  the  child  in  the  study."  The  hypotheses  considered  by 
Grizzle  may  be  stated  as  [1],  [8]: 

Case  1.  "The  test  of  the  hypothesis  of  no  interaction" 
or  logit  additivity. 

Then  "to  test  hypotheses  about  the  parameters  in  the 
model . " 

Case  2.  "Is  there  a  difference  between  problems  and 
controls?" 

Case  3.  Is  there  "equality  of  birth-order  effects?" 

Case  4.  a)  "Is  the  effect  of  birth-order  linear?" 
b)  "Does  it  require  a  second-degree 
polynomial  to  describe  the  effect?" 

It  was  shown  that  the  null  hypothesis  of  logit  additivity 
of  case  1  is  satisfied  [8]  .  Maximum- likelihood  estimates  for 
the  subhypotheses  of  cases  2,3,4  were  obtained  in  terms  of  linear 
contrasts  of  the  logits,  but  did  not  include  a  specific  restraint 

of  logit  additivity  [8].  Berkson  [1]  gave  the  maximum-likelihood 

.  2 
estimates,  as  well  as  minimum  logit  x  estimates  under  the  same 
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restraints  of  the  linear  contrasts  of  the  logits.  The  maximum- 
likelihood  estimates  are  given  in  table  2.  The  resulting 
maximum- likelihood  estimates  for  cases  2,3,4  consequently  do 
not  satisfy  logit  additivity  as  shown  in  (2.1)-(2.4).  Similar 
results  also  follow  for  the  minimum  logit  x  estimates .  The 
procedure  we  shall  present  derives  maximum-likelihood  estimates 
for  the  various  cases  that  do  satisfy  logit  additivity. 


3 .  The  log-linear  Representation 


In  accordance  with  the  minimum  discrimination  infor¬ 
mation  theorem  [11,  p.  38],  [13]  a  log-linear  representation 
for  the  observed  values  x(ijk)  of  table  1  for  the  complete 
set  of  marginals,  including  observed  cell  values  (as  three- 
way  marginals),  is  given  analytically  by  (3.1)  where  iT(ijk) 
is  the  uniform  distribution  and  n  is  the  total  number  of  ob¬ 
servations,  and  graphically  in  table  3a.  The  values  in  column 
1  of  table  3a  are  essentially  those  of  a  normalizing  constant 
(the  negative  of  the  logarithm  of  the  moment-generating  func¬ 
tion)  .  .The  values  in  the  other  columns  of  table  3a  are  those 
of  a  set  of  linearly  independent  functions  T(ijk)  whose  mean 
values  are  various  marginals .  Note  that 


Tli  (i jk)  =  Tj(ijk)Tjdjk)  ,etc.;T*J*(ijk) 


T*(ijk)Tj(ijk)T*(ijk) ,etc, 


1 


i  j 


where  the  T(ijk)  are  indicator  functions  of  the  various  mar¬ 
ginals  [13]. 


,f***MW 


Case  2 


,.17.812  _  0_12 .188  _  »_  «  „„„ 
**84 !  188  "  ^5l!l82  “  *n  0*980 


„_23. 812 

„_24 . 812 
^24.188 


=  ln  0,843 
- tn  1-319 


Case  3 


,.29 .925  „_19 . 925 

^ 72. 075  "  ^TTTSTT  “  tn  0,918 

,.22.471  _  „_12 . 471  _ 

^44  !  529  1  3 5  I  5 2 $  “  *n  1,357 

**16,396  ^nI5T US'  ~  *n  2,805 


(2.2) 


Case  4a 


,.27 . 535  „_17 . 535  _ 

ta,y4 1 465  ^46  1 465  =  0,980 

=  In  1.189 

**W. 535  "  £nKT.535  "  tn  3,113 


(2.3) 


Case  4b 


V-  21,191  _  ,.11.191  _  p  1  ™ 

^'36.809  ^n$2.309  “  *n  1,237 

,.23.618  ,_13 . 618  _  ,  ^nr  ,  ^  ,\ 

^43!  382  “  382  *n  1*295  (2.4) 

p  28 . 191  _  ,.15.191  _  „n  .  q . c 
^20.809  1  21 1  809  “  1,945 


-  L  +  +  t£xJ  u.jio 

+  T^Tj(ijk)  +  TjTjttjk)  +  T^T^j(ijk) 

’  ..  (3,1> 

+  T^Tgdjk)  +  t12T12  ^jk)  +  TnTil(ijk) 

+  ^2T12<«k>  +  'uMnliiW  +  ^U2T112(iik» 


--  >-*  — - r-n.  A.i-< 


From  table  3a  and  the  representation  in  (3.1)  we  may  express 
the  logits  over  losses/no  losses  in  the  observed  contingency  table 


parametrically  as  in  (3.2).  The  values  of  the  effect  and  inter¬ 
action  (tau)  parameters  for  x(ijk)  are  given  in  table  5.  Let  us 
now  consider  the  implication  of  the  hypotheses. 


In 


x(lll) 
x (121) 


Tij  +  Tjk 
T11  T11 


+ 


x (122) 


T3  +  X13 
1  11 


x (113) 
(123) 


(3.2) 


(211)  _  j 
tox(22l)  T1 


+  T 


jk 

11 


x(212i  _  j 
^  x (222)  T1 


+  T 


jk 

12 


x (213) 
x(223) 


j 


T 


1 


Case  1 

For  logit  additivity  (no  second-order  interaction)  the  es¬ 
timate  must  satisfy  x*Jk=T*jk=0,  in  order  that  the  difference 

between  corresponding  logits  for  problems  and  controls  (i=l,2) 
be  the  same.  Its  representati  ->n  is  given  by  columns  1-10  of 
table  3a.  The  minimum  discrimination  information  estimates  are 
thus  constrained  to  have  the  same  two-way  marginals  as  the  observed 

table  (implying  all  lower  order  marginals)  and  may  be  obtained  by 
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the  Deming-Stephan  iterative  algorithm.  This  procedure  was 
illustrated  for  this  data  by  Ku  et  al  [10,  p.  59].  If  we  denote 

the  estimates  by  fitting  the  two-way  marginals,  that  is, 
satisfying  logit  additivity, as  (i jk) ,  then  the  corresponding 

logits  may  be  expressed  parametrically  as  in  (3.3). 


x*(lll) 

x*(112) 

*nxjrmy 


In 


x*(113) 

*ja«y 


xj  (211) 

*nx*"[22i) 

x*(212) 

lnxflJTZT 


In 


x^  (213) 


x3  +  Tij  +  xjk 
1  11  T11 


x3  +  t13 
1  11 


x  3  +  xiD 
1  T11 


+  Tjk 
T11 


+  Tjk 
T12 


y  + 


=  y  +  a- 


+  B« 


=  T 3 


t? 


+  Tjk 
T12 


V  +  <*i 


y  +  a2  +  B^ 


=  1J  +  a. 


V  +  a. 


+  0. 


+  B. 


+  B. 


(3.3) 


We  remark  that  the  numerical  values  of  the  tau  parameters 
in  (3.3)  are  of  course  not  the  same  as  those  in  (3.2),  even 
though  we  use  the  same  notation  for  simplicity.  We  may  also 
consider  a  parametrization  in  terms  of  orthogonal  polynomials 
[9,  p.  359]  as  in  (3.4)  assuming  equal  spacing  of  birth-order 
effects  [8] . 

Grizzle  used  the  parametrization  of  (3.3)  in  y,a^,0j  with 


ai+a2=0'  ei+e2+B3=0  [8] * 
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Tj  +  Tij  +  Tj* 
T1  T11  T11 


al  “  bl  +  b2 


Tj  +  Tij  +  Tjk 

T1  T11  t12 


al  -  2b2 


Tj  +  Tlj 
T1  T11 

T1  T11 


ai  +  bl  +  b2 


a2  -  bi  +  b2 


(3.4) 


t12 


a2  '  2b2 


a2  +  fai  +  b2 


To  test  the  null  hypothesis  of  case  1  we  compute  the 


statistic 


2l(x:x*)  =  2£££x(ijk)  £n(x(ijk)/xj (ijk) )  =  0.853,  2D.F. 

The  null  hypothesis  of  logit  additivity  is  accepted.  Note 
that  this  is  derated  as  Model  (1)  by  Bishop  [2,  p.  556]. 

We  see  from  (3.3)  and  (3.4)  that 


V  +  a2  +  83  =  a2  +  b^  +  b2 


al  “  a2  =  al  a2 


Tjk  - 
T11 


8q  -  63  =  -2bL 


(3.5) 


-  83  =  -bx  -  3b2 


2t12  “  61  “  262  +  63  =  6b2 
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The  values  of  x^(ijk)  are  given  in  table  4  and  the  common 
logit  difference  is  The  values  of  the  parameters  for 

x*  are  given  in  table  5. 

Case  2 


For  the  subhypothesis  that  there  is  no  difference  in 
the  logits  between  problems  i=l  and  controls  i~2  in  the  model 
of  logit  additivity  we  must  have 

T11  =  ai  “  a2  =  al  “  a2  =  0 

in  (3. 3) -(3. 5),  that  is,  ti.i  corresponding  estimate  x*(ijk) 


must  satisfy 


T 


ijk  = 
111 


0. 


The  representation  for  x*(ijk)  is  given  by  columns  1-5,  7-10 

of  table  3a.  The  estimate  x*(ijk)  must  thus  have  the  two-way 

marginals  x*(i.k)  =  x(i.k),  x*(.jk)  =  x(.jk)  and  the  implied 

lower  ord^r  marginals.  In  this  case  the  estimate  is  explicitly 
given  by 

x*(ijk)  =  x(i.k)x(.  jk)/x(.  .k) . 

Note  that  this  is  denoted  as  PA  Model  (2)  by  Bishop  [2,  p.  556]. 
The  numerical  values  of  x*(ijk)  are  given  in  table  4  and  the 

common  logit  difference  in  this  case  is  zero.  The  parameter 
values  for  x*  are  given  in  table  5. 


r. 
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Case  3 


For  the  subhypothesis  that  there  is  equality  of  birth- 
order  effects  k=l,2,3  in  the  model  of  logit  additivity  we  must 

have  P1=^2=^3  or  Tll=T12=0  °r  bl=b2=°  in  (3,3)“(3,5)'  that  is, 

the  corresponding  estimate  x*(ijk)  must  satisfy 

ijk  =  ijk  =  jk  =  jk  = 

111  112  11  12  0  * 

The  representation  for  x*(ijk)  is  given  by  columns  1-8  of  table 
3a.  The  estimate  x*(ijk)  must  th  have  the  two-way  marginals 
x*(ij.)=x(ij.) ,x*(i.k)=x(i.k)  and  the  implied  lower  order  mar¬ 
ginals.  In  this  case  the  estimate  is  explicitly  given  by  (a 
version  of  Bishop's  PA  model  [2,p.  556]) 
x*(ijk)  =  x(ij  .  )x(i.k)/x(i. . )  . 

The  numerical  values  of  x*(ij) )  are  given  in  table  4.  The 
common  xogit  difference  is  t*3.  The  parameter  values  for  x£ 

are  given  in  table  5. 


Case  4a 

ior  the  null  subhypothesis  Ko^  that  the  linear  component  is 

•j 

zero  in  the  model  of  logit  additivity  we  must  have  3 1^3 3  or  =  0 
or  b^  =  0  in  (3. 3) -(3. 5),  that  is,  the  corresponding  estimate  x*(ijk) 


- --  .--i .-j 
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must  satisfy  =  T^2  =  T11  =  T^e  representation  for 

x* (ijk)  is  given  by  columns  1-8,10  of  table  3a.  The  estimate 

x*(ijk)  must  thus  satisfy  the  restraints 

xg  (i  j  • )  =  x(ij  . )  ,  x*  (i.k)  =  x(i.k),  x*(.j2)  =  x(.j2) 

on  the  two-way  marginals  and  the  implied  lower  order  marginals. 

The  values  of  x*(ijk)  are  obtained  by  the  iteration  (cf.  Gokhale  [7]) 

x(3n+1Ujk) 

«<3n+2Ujk) 

x<3n+3Ujk> 
x<3n+3Ujk) 
k=l,3. 

The  numerical  values  of  x*(ijk)  are  given  in  table  4.  The 
common  logit  difference  is  t^.  The  parameter  values  for 
x*  are  given  in  table  5. 

Case  4b 

For  the  subhypothesis  Ho^  that  the  quadratic  component  is 
zero  in  the  model  of  logit  additivity,  we  must  have  or 


^t12=t11  or  (3.3) -(3.5 ),  that  is,  the  corresponding 

estimate  xj(ijk)  must  satisfy  x^jk.=  x^jk  =  °»  2t12  =  T11  =  T^k 

T  )c 

where  we  use  the  parameter  xJ  to  avoid  confusion.  To  determine 
the  restraints  and  estimation  procedure  for  x£(ijk)  we  note  that 

the  requirement  2x^k  =  =  x^k  implies  that  in  the  log-linear 

regression  (3.1)  there  appears  a  term  of  the  form 

...xjk(T^(ijk)  +  §-t12  (ijk) )  *  -  -  (3.7) 

so  that  the  corresponding  revised  graphic  table  for  x*(ijk) 

appears  as  in  table  3b.  The  estimate  x^'(ijk)  must  satisfy 

the  restraints 

x*(ij.)  =  x (ij . ) ,  x*(i.k)  =  x(i.k) 

xj(.jl)  +  xh<*32)/2  =  x(.jl)  +  x(.j2)/2 

and  those  implied  on  the  lower  order  marginals.  The  values  of 
x£(ijk)  are  determined  by  the  generalised  iterative  scaling 

procedure  of  Darroch  and  Ratcliff  [5]  (see  appendix) 


x(3n+1Ujk)  -  <x(ij.)/x(3nUj.))x(3nUjk) 
x(3n+2^ijk)  =  (x(i>k)/x(3n+l^i  k))x(3n+l^ijk) 

(3.8) 

X(3n+3\ijk)  = 


( 


( 


h3  ) 


(ijk) 


a. (ijk)  ,  h2 


a- (ijk) 

\ 


(3n+21. 


where  the  values  for  ar(ijk) 
6.  The  numerical  values  of 
common  logit  deference  is  t 


/  hr,  r=l ,2,3  are  given  in  table 
xh*A3k)  are  given  in  table  4.  The 
11*  The  parameter  values  for  x* 


are  given  in  table  5. 


Remarks 


It  is  of  interest  to  note  that  the  procedure  used  by  Grizzle 
[8]  which  leads  to  the  estimates  *J,x,*,xJ,x*  for  case  2, 3, 4a, lb 

respectively,  given  in  table  2,  is  in  fact  equivalent  to  using 
the  parametric  tau  representation  in  (3.2)  rather  than  that  in 
(3.3).  The  contrasts  in  terms  of  the  logits  thus  imply  the 
following  relations  among  the  tau  parameters. 

c  ase  2 


Contrast :  fn^il  +  x(212)  .  x(2l; 

*U21  x(122)  x  (123)  x(221) +inx(222)  ^^22; 


that  is, 


3  xJ+3  J+  t  Tj  jk=3  Tj+  Tjk+  T  jk  f 


3T^+Xijk+Tijk 

11  HI  112  * 


Contrast : 


Case  3 
x(lll) 


t3+  ttJ+2  t3^+  Tr:t=2  t3  +  t* 3+2  T3*+Ti  jk«2  Tj+Ti  j  , 
AA  11  111  1  li  12  112  1  11 


2TU+Tiu=°-  2Tis+  T^2=°- 


that  is 


-14- 


Case  4a 

„  ,  x(lll)  „  x  (211)  „  x  (113)  „  x  (213) 

Contrast:  £n^y2T)  ^TOTTT^^JTnY 


2T0+Tij+2T3k+Tijk=2Tj+Tij 
T1  T11  T11  Tlll  T1  Tll' 


that  is. 


2t jk+ti jk=0 
11  111 


Case  4b 


that  is , 


Contrast:  4£n|||^|.=2  <*n£{^+£n£{§A|| 

',T1+2tU+2T11+T11X=2  <2t1*TU+2t12+TU2>  ' 


2T3k+Tijks=4  jk  2  ijk 
11  111  12  112* 


It  may  be  verified  that  the  restraints,  implied  by  the  relations 
among  the  taus,  which  the  estimates  must  satisfy,  are  indeed  satis¬ 
fied  by  those  given  in  table  2.  The  restraints  are: 

Case  2:  x*(i.k)  =  x(i.k),  xj(.jk)  =  x(.jk), 

2x* (ijl)  -  x* (i j 2  )  -  x* (ij 3)  -  2x(ijl)  -  x(ij2)  -  x(ij3), 

-  x*(ijl)  +  2x*(ij2)  -  x*(ij3)  =  -x(ijl)  +  2x(ij2)  -  x(ij3); 


Case  3 


:  x* (i j . )  =  x (i j . ) ,  x*(i.k)  =  x(i.k) 


x£(ljk)  -  x£(2jk)  =  x(ljk)  -  x (2 jk) ; 

Case  4a:  x*(ij.)  =  x(ij.),  x*(i.k)  =  x(i.k),  x*(.j2)  =  x(.j2), 
>*(ij2)  -  x(ij2),  x*  (1  jk)  -  x*  (2  jk)  =  x(ljk)  -  x(2jk); 
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Case  4b:  x£(ij.)  =  x(ij.),  x£(i.k)  =  x(i.k), 

x£(ljk)  -  x£(2jk)  =  x(ljk)  -  x  (2 jk)  , 

2x*(ijl)  +  x£(ij2)  =  2x  (i  j  1)  +  x(ij2), 
x£(ij2)  +  2x|(ij3)  =  x(ij2)  +  2x(ij3). 

For  convenience  we  summarize  the  conditions  on  the  tau 
parameters  associated  with  each  of  the  estimates. 


Case 

1: 

ijk  =  ijk 
2'  TU1  t112 

=  o, 

Case 

2: 

x*  ij  ..  ijk  _ 
xe'  T11  Tlll 

ijk  -  0 

Case 

2: 

x*  3ij  +  ijk 
xa'  JT11  +  Tlll 

+  t112  =  <>, 

Case 

3: 

x*  Tjk  _  jk  _ 
f'  T11  t12  ~ 

Tm  =  -  o. 

Case 

3: 

x,*  2rjk  +  t  1  k 
Xb'  /T11  +  Tlll 

=  o,  2Tjk  ♦  xjjk  =  0, 

Case 

4a : 

x*  _  jk  _  ijk  _ 
g'  T11  Tlll  ~ 

Tljk  =  0 
t112  U' 

Case 

4a : 

X*  2rjk  + 

xc'  4JT11  Tlll 

=  0  Tijk 

Ur  T112r 

Case 

4b : 

x*  2Tjk  =  Tjk  - 

n'  /t12  t11 

Jk  ijk  ijk  =  Q 

T  '  Tlll  t112 

Case 

4b : 

x*  2Tjk  +  Tijk 

Xd'  ^Tll  +  Tlll 

=  4T jk  +  2xijk 
t12  ^t112 ' 

As  we  shall  see  in  section  4,  we  accept  the  subhypothesis 
of  case  2  that 

x* jk=Tijk_Tij_Q . 

Ill  112  11 


It  seems  reasonable  therefore  to  reexamine  the  effects  of  birth- 
order  within  the  combined  model  of  case  1  and  case  2. 


Case  5 


For  the  subhypothesis  of  case  3  within  case  1  and  case 
the  corresponding  estimate  x^(ijk)  must  satisfy 


O 

*  « 


ijk  ijk  ij  jk  jk. 

T  111*  112^  11  T11  t12  u* 


The  representation  for  x^(ijk)  is  given  by  columns  1-5, 7, 8  of 

table  3a.  The  estimate  is  explicitly  given  by 
x*(ijk)  =  x(. j.)x(i.k)/n. 

The  numerical  values  of  x^(ijk)  are  given  in  table  4.  The  common 
logit  difference  is  zei'o.  The  parameter  values  for  x*(ijk)  are 
given  in  table  5. 


Case  6a 


For  the  subhypothesis  of  case  4a  within  case  1  and  case  2, 
the  corresponding  estimate  x*(ijk)  must  satisfy 


ijk  ijk  ij  jk=0 
Tlll  x112  T11  T11  u 


The  representation  for  x*(ijk)  is  given  by  columns  1-5,7,8,10 

n 

of  table  3a.  The  values  of  x*(ijk)  are  obtained  by  the  iter¬ 
ation  (3.6)  except  that  the  first  cycle  is  replaced  by 

x(3n+1Ujk)  =  (x(.j.)/x(3n\.j.))x(3nUjk)  . 

The  numerical  values  of  x*(ijk)  are  given  in  table  4.  The 

common  logit  difference  is  zero.  The  parameter  values  for 
x*(ijk)  are  given  in  table  5. 


Case  6b 


For  the  subhypothesis  of  case  4b  within  case  1  and  case  2 , 


the  corresponding  estimate  xMijk)  must  satisfy 


ijk  ijk  ij  2t jk  jk  jk , 
111  112  X11  1  '  *  12  11 


The  representation  for  x*(ijk)  is  given  by  columns  ?.-5,7-9  of 


table  3b.  The  values  of  x*  (ijk)  are  determined  by  i:he  iteration 


(3.8)  except  that  the  firjt  cycle  is  replaced  by 
x(3n+1Ujk)  =  (x(.  j.)/x(3n\.  j.))x(3nUjk)  • 


The  numerical  values  of  x*(ijk)  are  given  in  table  4.  The  com¬ 


mon  logit  difference  is  zero.  The  parameter  values  for  x*(ijk) 


are  given  in  table  5. 


4.  The  Analysis  of  Information 


To  test  the  various  hypotheses  and  their  associated 
estimates  we  set  up  analysis  of  information  tables  showing 
interaction  type  measures  and  effect  type  measures  [10]  ,  [13]  . 
We  recall  that  interaction  type  measures  compare  the  original 
table  with  an  estimated  table  and  test  a  hypothesis  that  the 
values  of  the  tau  parameters  of  the  original  table  satisfy  the 
restraints  on  the  tau  parameters  specifying  the  estimated 
table.  This  value  may  also  be  interpreted  as  a  measure  of  the 
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goodness -of -fit  of  the  estimate  obtained  by  fitting  the 
implied  relations  on  the  estimated  cell  values.  Effect 
type  measures  compare  two  estimated  tables  and  test  a 
hypothesis  that  the  additional  parameters  in  one  of  the 
estimates  differ  from  their  zero  values  in  the  other  es¬ 
timate.  This  value  may  also  be  interpreted  as  a  measure 
of  the  effect  of  the  additional  restraints  on  the  estimate 
in  the  fitting  procedure. 

We  first  compare  the  estimates  within  the  model  of 
logit  additivity  with  those  obtained  by  using  linear  con¬ 
trasts  of  the  logits.  In  the  analysis  of  information-I  we 

also  include  the  values  of  the  Pearson 
2  2 

X  mlll (x(ijk) -x*(ijk) )  /x* (ijk) ,  for  r=a,b,c,d,  given  by  Grizzle  [8]. 

We  note  that  x*  and  x*,  x£  and  x$,  x*  and  x*,  x*  and 
a  e  f  c  g  d 

x£  do  not  differ  significantly  from  one  another.  This  is 

not  surprising  since  from  the  value  of  2I(x:x£)  we  inferred 

that  the  values  of  and  f°r  x(ij*)  do  not  differ 

significantly  from  “  0*  However,  though  the 

estimates  x * , x£ , x* , x£  and  the  estimates  xg»xf'xg'xh  satisfy 

the  linear  contrasts  for  the  logits  only  the  latter  four 
satisfy  logit  additivity.  We  note  that  xe'xf'xg'xh  are 

estimated  using  fewer  tau  parameters  than  x? * x* , x* , x2  re- 

tt  D  C  Q 

spectively . 

Within  the  model  of  logit  additivity  we  have  analysis 
of  information-II  and  within  case  1  and  case  2  analysis  of 


information-III . 
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Analys 

is  of  Information-I 

Component  due  to 

Information 

2 

X 

D.F. 

Case  2 

Tij_Tijk_  ijk_n 

11  t111"t112“u 

21 (x  :x*)  =  3.154 

3 

Difference 

21 (x* :x*)=  0.654 

d  6 

2 

3xij+Tijk+  ijk= 

JT11  Tlll  t112  u 

2  T  (>.  :x*)  =  2.500 

d 

2.475 

1 

Ca  je  3 

Tjk=  jk=  ijk= tijk=Q 

T11  t12  t111  t112  u 

21  (x  :x*)=28.199 

4 

Difference 

2I(x*:x*)=  3.724 

2 

**  r jk+Tijk  0 

11+T111  U/ 

2x^k+Ti^k=0 

12  T112  U 

21  (x  :x*)=24.475 

24.239 

2 

Case  4a 

/ 

Tjk=  ijk_  ijk_Q 

11  111  112  u 

21 (x  :x*) =25 . 257 

g 

3 

Difference 

21  (x* :x*)=  3.932 
c  g' 

2 

2T3k+Tijk=0  Tijk 
*11 111  '  112 

21 (x  :x*) =21 . 325 

C 

22.765 

1 

Case  4b 

2Tjk=Tjk  jk 
*T12  t11  t  ' 

Tijk  ijk 

Tlll  t112  u 

21  (x  :x*)=  2.131 

3 

Difference 

2I(x*:x*)=  0.670 

2 

2xjk  ijk=4  jk  ijk 

*T11  Tlll  *T12+ZT112 

2I(x  :x£)  =  1.461 

1.478 

1 
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Analysis  of  In formation- I I 


Component  due  to 

Information 

x2  D.F. 

Case  2 

Tij=Tijk=  ijk=0 

T11  111  x112  u 

21 (x  :x|)  =  3.154 

3 

Conditional  effect  of 

2l(x*:xJ)=  2.301 

1 

Tijk_ijk 

Tlll  4 112_u 

21 (x  :x*)=  0.853 

0.851  2 

Case 

3  and  4a 

Tjk  jk  ijk  ijk=Q 

T11  x12  Tlll  t112  u 
or  (b1=0,  b2=0) 

21  (x  :xf.)=28.199 

4 

Conditional  effect  of  t^2 

or  quadratic  component 

21 (x* :xi) =  2.942 

9  i 

1 

21 (x  :x*)=25  257 

9 

3 

Effect  of  linear  component 
given  quadratic  component 

2I(x*:x*)=24.404 
c  g 

1 

Tijk  ijk 

Tlll  t112  u 

21 (x  :x*)=  0.85! 

2 

Case 

3  and  4b 

21 (x  :x*)=28.199 

4 

Effect  of  linear  component 

21 (x^:x|)*26 .068 

1 

2TjkeTjk  jk  ijk  ijk=0 
2T12  T11  T  ,T111  T112  u  °r 
(b2=0) 

21 (x  :x*)=  2.131 

3 

Effect  of  quadratic  component 
given  linear  component 

2I(x*:x*)=  1.278 

1 

2 :(x  :x*)  =  0.853 

2 
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Analysis  of  Information-Ill 


ComDonent  due  to 

Information 

D.F. 

Case  5  and 

6a 

Tij_Tjk_Tjk_Tijk  ?  k 

11"  11“  12“  111"  l_2“u 

21 (x  ; x* ) =29 . 831 
m 

5 

Conditional  effect  of  rjj 
or  quadratic  component 

2I(x*:x*)=  2.904 
n  m 

1 

Tij=Tjk=  ijk=  ijk= 

T11  T11  Tlll  t112  u 

21  (x  :x*)=26.927 
n 

4 

Effect  of  linear  component 
given  quadratic  component 

2l(x*:x*)=23.773 
e  n 

1 

ij  ijk  ijk 

T11  Tlll  t112  u 

21  (x  :x*)=  3.154 
e 

3 

Case  5  and 

6b 

ij=Tjk  jk  ijk  ijk=Q 

T11  T11  x12  t111  t112  u 

21 (x  :x*) =29 . 831 
m 

5 

Linear  component 

21 (x* :x*) =25 . 399 

P  n 

1 

Tii=Tiii=Tii2=° 

21  (x  :x*)=  4.432 

P 

4 

Quadratic  component 
given  linear 

21 (x* :x*)=  1.278 
e  p 

1 

T 


ijijk  ijk=Q 
11  Tlll  x112  u 


21 (x  :x*)  =  3.154 
e 


3 
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We  infer  that  there  is  no  difference  between  problems 
and  controls.  The  effect  of  birth-o.der  is  not  homogeneous 
and  the  logits  may  be  expressed  by  a  linear  regression 
assuming  equal  spacing  of  the  effects. 

Within  the  model  of  logit  additivity  we  may  summarize 
the  analysis  in  the  following  form. 


Information 

Source _ (log-likelihood  ratio)  D.F. 


Problems  vs  Controls 

3 

.154 

3 

Birth-order  effects 

28 

.199 

4 

Linear 

26.068 

1 

Quadratic 

2.942 

1 

Quadratic 

(conditional) 

1.278 

1 

Linear 

(conditional) 

24.404 

1 

Interaction 

0.853 

2 

Interaction 

0.853 

2 

Within  the  model  of  case  1  and  case  2  we  may  summarize 
the  analysis  in  the  following  form 


Source 


Information 

(log-likelihood  ratio) _ D.F. 


Birth-order  effects  and 

Problems  vs  Controls  29.831  5 


Linear 


25.399  l  Quadratic 


2.904  1 


Quadratic 

(conditional) 

Conditional 
effect  of 


(Pro*  lems  vs 
Con  crols) 


1.278  l 


Linear 

(conditional)  23.773  1 


2.301 


1 


Interaction 


0.853 


2 
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Appendix 


We  give  in  this  appendix  the  generalised  iterative  scaling 

algorithm  as  given  by  Darroch  and  Ratcliff  [5].  Let  I  be  a 

finite  set  and  let  p=  {p (i) ; i€l ,p (i) >0 ,  7  p(i)  =  l}  be  a  probability 

iEl 

function  on  I.  Suppose  that  p  is  a  member  of  a  family  of  distri¬ 
butions  satisfying  the  constraints 


7  b  p(i)  =  k  ,s  =  1,2, ...d  7  p(i)  =  1 
i€I  si  s  i&I 


(A.l) 


where  for  all  s  there  exist  iEI  such  that  b  .^0.  The  constraints 

Si 

in  (A.l)  may  be  reformulated  into  the  equivalent  canonical  form 


I  ■  h  ,r  =  1,2, ...c 

i€I 

ari=° •  ^  ari  “  1'h^>0'  ^  hr  =1 
ri  r=l  r  r=l  r 


(A. 2) 


by  defining 


asi  -  tB(us+bsi)'  a11  L> 
hs  =  ts(us+ks)  '  9sl'2',,,,d 


(A. 3) 


where  u  >0 ,  t  >0  are  chosen  to  make 
s  s 


a  .>0  and  7  a  .  <1  for  all  i£I. 

ol=  L _  g Ik 


If  7  a  .=1  for  all  i  define  c=d,  otherwise  define  c=d+l  and 
s=l  31 


d  d 

:i  =  1  ^  asi'  hc  =  1  "  I  hs* 

s=l  s=l 


let 
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Now  let  tt=  { tt (i)  t  i€I,ir(i)>0,  Y  Tr(i)^l)  be  a  subprobability 

i€I 

function  on  I.  The  minimum  discrimination  information 
estimate  p*(i),  i€I,  is  that  member  of  the  family  p  satisfy¬ 
ing  the  restraints  (A. 2)  and  minimizing 


■  Jrp(1) 


(A. 4) 


and  is  given  by 

=  ^  a  -Tr  (A*5) 

11  V-L'  r=l  n  r 

where  the  xr  are  parameters  to  be  determined  so  that  p*(i) 


satisfies  the  constraints  (A. 2).  The  values  of  p* (i)  may 


be  determined  by  the  convergent  iteration 

a 


(n+1)  (n)  c 

P  (i)  =  p(i)  Jl 
r=l 


n 


\  r 


n— 0 f X  f 2  f  •  •  • 


(A. 6) 


p(0U)  =  Tr(i) ,  h[n)  = 


J.' 


ri* 


<nU>. 
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Table  1 

Data  on  number  of  mothers  with  previous 
infant  losses  (Cochran  [4]) 

No.  of  mothers  with 


Birth 

order 


j=l 

j= 

Losses 

NO  1 

Problems 

20 

82 

Controls 

10 

54 

Problems 

Controls 

26 

16 

41 

30 

Problems 

~T7 

22 

Controls 

14 

23 

Table  2 

Maximum-likelihood  estimates  from  Berkson  [1],  Grizzle  [8] 


Case  1 


Case  2 


Case  3 


Case  4a  Case  4b 


:)  fi  I*] 


rj'JV'lrJ.I 


r  /  pr  :TiH 


14.787 

15.716 


18.188 

16.188 


ii'Kyu 


12.471 

7.604 


16 

6.465 


13.618 

15.191 


X 

X2 

n 

X* 

xd 

20 

20.503 

17.812 

29.925 

27.535 

21.191 

26 

27.213 

23.812 

22.471 

26 

23.618 

27 

25.284 

24.812 

20.604 

19.465 

28.191 

41 

39.787 

43.188 

44.529 

41 

43.382 

22 

23.716 

24.188 

28.396 

29.535 

20.809 

Ti 


4605  ...  ...  -1.4344 


1  -  ■ 


Table  6 


a~ (i jk)  0  0 


112 

113 

121 

122 

123 

211 

212 

213 

221 

222 

1/2 

0 

0 

0 

0 

1 

1/2 

0 

0 

0 

0 

0 

1 

1/2 

0 

0 

0 

0 

1 

1/2 

1/2 

1 

0 

1/2 

1 

0 

1/2 

1 

0 

1/2 

h1-x(lH,+|.x(112)+x(211)+^x(212)-x(.ll)+^x(.12) 

h2-x(121)+jx(122)+x(221)+^x(222)=x(.21)+|x(.22) 

h3-yx(112)+x(113)+jx(122)+x(123)+jx(212)+x(213)+yx(222) 


+x(223)-x(.  .3)+ix(.  .2) 
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